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SUBSPACES WITH EXTRA INVARIANCE NEAREST TO OBSERVED DATA. 


C. CABRELLI AND C. A. MOSQUERA 


Abstract. Given an arbitrary finite set of data T = {/i, ... ,/ m } C L 2 (R d ) we prove the existence 
and show how to construct a “small shift invariant space” that is “closest” to the data T over certain 
class of closed subspaces of L 2 (R d ). The approximating subspace is required to have extra-invariance 
properties, that is to be invariant under translations by a prefixed additive subgroup of M. d containing 
7j d . This is important for example in situations where we need to deal with jitter error of the data. Here 
small means that our solution subspace should be generated by the integer translates of a small number 
of generators. An expression for the error in terms of the data is provided and we construct a Parseval 
frame for the optimal space. 

We also consider the problem of approximating T from generalized Paley-Wiener spaces of R d that 
are generated by the integer translates of a finite number of functions. That is finitely generated shift 
invariant spaces that are translation invariant. We characterize these spaces in terms of multi-tile sets 
of R d , and show the connections with recent results on Riesz basis of exponentials on bounded sets of 
R d . Finally we study the discrete case for our approximacion problem. 


1. Introduction 

Let H be a Hilbert space, C a class of closed subspaces of % and T = {/i,... ,/ m } a finite set of 
elements in %. 

In this article we study the existence and show how to construct an optimal subspace S in the class 
C that minimizes the distance to the given data J 7 , in the sense that S minimizes the functional ^(J 7 , S) 
over C. The functional is defined as 

m 

(1) n(T,S)='£\\f j -P s fj\\ 2 , 

j =i 

where Ps denotes the orthogonal projection on the subspace S. 

The motivation to find an optimal subspace in C is, that in many situations one wants to choose a 
model for a certain class of data. Instead of imposing some conditions on the data to fit some known 
model, the idea is to define a large class of subspaces convenient for the application at hand, and find 
from there the one that “best fits” the data under study. 

The signals that need to be modelled are ideally low dimensional but living in a high dimensional 
space. However, since in applications they are often corrupted by noise, they become high dimensional, 
however they are close to a low dimensional subspace, which is the space one seeks. 

When the Hilbert space is L 2 (R d ) it is natural to consider as a model for our data the class of shift 
invariant spaces (SIS), that is, closed subspaces of L 2 (R d ) that are invariant under translations by integers. 
These spaces have been used in approximation theory, harmonic analysis, wavelet theory, sampling theory 
and signal processing (see, e.g., [51 H71I2U1122] and references therein). Often, in applications, it is assumed 
that the signals under study belong to some shift invariant space V generated by the translations of a 
finite set of functions $ = {< p\ , • • • , y> m }, i.e., V = 5'(<f>) = span {Tkipi : k £ Z d , i = 1, ■ ■ ■ , m}. 
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The choice of the particular finitely generated shift invariant space typically is not deduced from a set 
of signals. For example in sampling theory, a classical assumption is that the signals to be sampled are 
band-limited, that is, they belong to the shift invariant space V generated by ip(x) = sinc(x). However, 
the band-limited assumption is not very realistic in many applications. Thus, it is natural to search for 
a finitely generated shift invariant space that is nearest to a set of some observed data. 

In this paper we study the case when T~L = L 2 {R d ). For this case, we restrict the class of approximating 
subspaces to be shift invariant spaces that have extra-invariance, that is: If M is a subgroup of R d such 
that Z d C M we will say that S(ip 1 , • • • , (f m ) is M extra-invariant if 

spanjTfcyjj : j = 1,...,m, k £ Z d j = spa n{T a (f>j : j = 1,... ,m, a € M}. 

Therefore, the space S(ipi, • ■ ■ , <p m ) is invariant under translates other than the integers, even though it 
is generated by the integer translates of a finite set of functions. Such spaces with extra-invariance are 
important in applications specially in those where the jitter error is an issue. 

We first consider the case when the subgroup M is a proper subgroup of K. d that contains Z d . For 
that case we obtain one of the main contributions of this paper. We prove that for any finite set of data 
T = {/i,..., f m } C L 2 (R d ), for any proper subgroup M containing Z d and for any £ € N there always 
exists a SIS V of length at most £ with extra invariance M whose distance (in the sense of JTJ)) to the data 
T is the smallest possible among all the SIS of length smaller or equal than £ that are M extra-invariant. 
(Here, the length of a SIS is the cardinal of the smallest set of generators). 

We construct a solution V and provide a set of generators whose integer translates form a tight frame 
of V. An expression for the exact value of the error E{T 1 V ) between the data and the optimal subspace 
is also obtained using the eigenvalues of some special matrix. 

Next, we consider the approximation problem for the class of generalized Paley-Wiener spaces. Given 
a measurable set H C (not necessarily bounded), the generalized Paley-Wiener space PWq associate 
to fl is the subspace of L 2 (R d ) corresponding to the functions whose Fourier transform vanished outside 
il. A generalized Paley-Wiener space is always invariant under translations by the whole group R d . In 
particular is a SIS of L 2 (R d ) that not necessarily need to be finitely generated. Under the hypothesis 
that PWq has a Riesz basis of integer translates, we proved that PWq is finitely generated if and only if 
H is a multi-tile, (see Proposition 14.31) . That is H is a multi-tile if and only if PWq has extra invariance 
M = R d . 

We study our approximation problem for those generalized Paley-Wiener spaces. We describe for this 
case how to construct a set of generators and show an interesting connection with recent results about 
bases of exponentials. This complete all the cases of extra invariance when M is not a proper subgroup. 

Finally we consider a similar problem when the Hilbert space is £ 2 {Z d ) and C is a conveniently chosen 
class of subspaces. We obtain for this case equivalent results to the ones for L 2 (R d ). The approximation 
problem for the discrete case is related with the continuous case in a very interesting way that is described 
in Section [5] 


1.1. Previous Work. Let us now mention some previous related work. The problem of approximation 
of a set of data by shift invariant spaces (without the extra invariance restriction) started in [2j where 
the authors proved the existence of a minimizer for m over the class of low dimensional subspaces in a 
Hilbert space H and also over the class of shift invariant spaces in L 2 (R d ). 

In [5] the case of multiple subspaces was considered in the finitely dimensional case. That is, the 
authors found a union of low dimensional subspaces that best fits a given set of data in and provided 
an algorithm to find it. In 2011, using dimensional reduction techniques, this algorithm was improved 
(see CP). 

Further, in [7] the authors found necessary and sufficient conditions for the existence of optimal 
subspaces in the general context of Hilbert spaces. However they did not provide a way to construct 
them. 

The first result for approximation of a finite set of data using shift invariant spaces with extra-invariance 
constrains appears in [6], where the authors consider principal shift invariant spaces in one variable and 
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they assume that the space has a generator with orthogonal integer translates, which is a key element in 
their proof. So the techniques of this particular case do not apply to our general case. 

1.2. Organization of the paper. The paper is organized as follows. In Section[2]we set the definitions 
and results that we need about shift invariant spaces, extra-invariance and the approximation problem 
for the case of shift invariant spaces in L 2 (M. d ). 

The main results of the paper are stated and proved in Sections lEO and [5] In Section [3] we present 
the M extra-invariant case for shift invariant spaces, in Section [4] the case of Paley-Wicner spaces and 
finally we consider a discrete case, in Section [5] 

2. PRELIMINARIES 

We begin with a review of the basic results and definitions that will be needed in subsequent sections. 
The known results are generally stated without proofs, but we provide references where the proofs can 
be found. Also, we introduce some of our notational conventions. For the definitions of Riesz bases and 
frames in Hilbert spaces we refer the reader to mug and the references therein. 

2.1. Shift Invariant Spaces. The structure of these spaces has been deeply analyzed (see for example 

Definition 2.1. A closed subspace V C L 2 (M. d ) is said to be a shift invariant space if 

/ G V => T k f £ V, for any k £ Z d , 

where T k is the translation by the vector k £ Z d , i.e. T k f(x) = f(x — k). 

For any subset $ C L 2 (R d ) we define 

S 1 ^) = span{Tfetp: tp £ Q,k £ Z d } and E(&) = {T k ip\ ip £ <f>, k £ Z d }. 

We call S($) the shift invariant space (SIS) generated by <f>. If V = 5($) for some finite set $ we say 
that V is a finitely generated SIS, and a principal SIS if V can be generated by the integer translates of 
a single function. 

For a finitely generated SIS V C L 2 (M. d ) we define the length of V as 

£(V) = min{n £ N: 3 </?i, ■ • • , ip n £ V with V = S(<pi, • • • , <p n )}- 

In addition to the construction of a set of generators of the optimal space for the problems considered in 
this paper, it will be important to estimate the error of these approximations. In order to compute these 
errors we need to consider what is called the Gramian G$ for a family of functions $ C L 2 (M. d ). 

More precisely, given $ = {</q, ■ • • , ip m } a finite collection of functions in L 2 (R d ), the Gramian of 
$ is the m x m matrix of Z d -periodic functions 

(2) [G$(w)]y = + k ) + k )- 

kez d 

The Gramian of $ is determined a.e. by its values at any measurable set of representatives U of the 
quotient R d /Z d and satisfies G$(w)* = G$(w) for a.e. to £U. We will take U = [—1/2, l/2) d . 

For a finitely generated SIS V, we can express the length of V in terms of the Gramian as follows (see 

mmisa) 

(3) £(Y) = essup [rk(G$(w))] 

oj(z.IA 

where rk(H) denotes the rank of a matrix B and $ is a generator set for V. 

One important property of the Gramian is given by the following lemma concerning the measurability 
of the eigenvalues and the existence of measurable eigenvectors of a non-negative matrix with measurable 
entries. 
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Lemma 2.2 (Lemma 2.3.5 of |23] ) . Let G(oo) be an m x m self-adjoint matrix of measurable functions 
defined on a measurable subset E C with eigenvalues Ai(w) > ■ •• > X m (oo). Then the eigenvalues 
i = 1,..., m, are measurable functions on E and there exists an m x m matrix of measurable functions 
U(oo) on E such that U(oo)U*(oo) = I a.e. u> & E and such that 

G(oo) = U(oo)A(oo)U*(oo), a.e. oo £ E, 


where A(w) := diag(X\(oo ),..., A m (w)). 

In pi], Helson introduced range functions and used this notion to completely characterize shift invari¬ 
ant spaces. Later on, several authors have used this framework to describe and characterize frames and 
bases of these spaces. See for example ds nu Ham [ig. We will mention the required definitions and 
some known results that we need later for giving the proofs of our results. We refer to unimmuni for 
a complete description and the proofs. 

Definition 2.3. Let / G L 2 (R d ) and fix U C R d to be a measurable set of representatives of the quotient 
R d /Z d . For oo £U, the fiber rf(oo) of / at w is the sequence 

a/M = {/(w + fc)} fceZ d. 

Here / denotes the Fourier transform of the function /, that is f(oo) = f Rd e~ 2mulx f(x) dx when 
/ G L 1 (M. d ). We observe that if / G L 2 (M. d ), then the fiber rf(oo) belongs to £ 2 (Z d ) for almost every 
uj G IA . 

If V is a finitely generated SIS and wGW we define the fiber space associated to V and uj as follows 

Jv{u) = {t/(w): / G V}, 

where the closure is taken in the norm of £ 2 (Z d ). 

With the above definitions we have: 

Lemma 2.4 (Proposition 5.6 of H). Let V = 5'(4 > ) be a finitely generated SIS. Then 

dim(Jy(w)) = rk (G$(w)), a.e. wGll. 

Lemma 2.5. If f G L 2 (R d ), then 

( i ) the sequence rf{oo) = {f(oo + fc)} fcgZ d is a well-defined sequence in £ 2 {Z d ) a.e. uo G U. 

(: ii ) ||T/(w)||f 2 is a measurable function of to and 

ll/ll 2 = ll/ll 2 = [ \\rf{uo)\\%duo. 

Ju 

Lemma 2.6. Let V be a finitely generated SIS in L 2 (R d ). Then we have 

(i) Jy(w) is a closed subspace of £ 2 (Z d ) for a.e. oo G U. 

(ii) V = {/ G L 2 (M d ) : rf(u>) G Jy(w) for a.e. oo G U}. 

(Hi) For each f G L 2 ( R d ) we have that ||r(Py/)(aj)|j p. is a measurable function of the variable oo and 

T (Pvf) M = P Jv (oj)(rf(oj)). 

(iv) Let ifi,..., ip m G L 2 (M. d ). We have that 

(a) {(^i,..., ipm) is a set of generators of V, if and only if {T(pi(oo ),..., T<p m (uj)} spans Jy(w) 
for a.e. oo £ U. 

(b) The integer translations of ip i,..., p>m o-re a frame (resp. Riesz basis) of V, if and only if 
T<pi(oo),..., Tip m (uj) are a frame (resp. Riesz basis) of Jv(oj) with the same frame (resp. 
Riesz) bounds, for a.e. oo £ U. 
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2.2. Optimality for the class of SIS in L 2 (R d ). In m the authors give a solution for the case where 
the approximation class is the class of SIS in L 2 (M. d ). For this, they reduce the optimization problem into 
an uncountable set of finite dimensional problems in the Hilbert space H = £ 2 (Z d ). 


Theorem 2.7 (Theorem 2.3 of [2]). Let T = , fm} be a set of functions in L 2 (J8L d ). Let 

Ai(w) > ••• > A m (u>) be the eigenvalues of the Gramian Gj^{uj). Then, there exists V* £ V e = 
{V: V is a SIS of length at most £} such that 

m m 

£ II fi - Pv*h\\ 2 < £ II/, - Pv/,ll 2 , Vbev'. 

i= 1 i= 1 


Moreover, we have that 


(1) The eigenvalues Aj(w), 1 < i < m are Z d —periodic, measurable functions in L 2 {U) and the 
approximation error is given by, 


m « 

£{T,l)= £ / Ai(w)dw. 
.• ,,, jw 


i=t+l 


(2) Let 0i(w) = A“ 1 (w) if Xi(co) is different from zero, and zero otherwise. Then, there exists a choice 
of measurable left eigenvectors y 1 (w), • • ■ , Y e (co) with Y l = [y\, ■ ■ ■ , i = 1, • ■ • ,£, associated 
with the first £ largest eigenvalues of such that the functions defined by 


Pi(uj) = 6 i {u)'Y^y)(w)f j (u), i = 1,■■■,£, uj eR d 
. 7=1 


are in L 2 (WL d ). Furthermore, the corresponding set of functions d> = is a generator 

set for the optimal subspace V* and the set — k), k € h d , i = 1, •■•,£} is a Parseval frame 
for V*. 


2.3. Extra invariance. We will need some definitions and known results concerning extra-invariance 
for shift invariant spaces. These are described in this subsection. 

Definition 2.8. Let V C L 2 (R d ) be a SIS. We define the invariance set as follows 

M :={xeR d :T x f GV, V/6b}. 

In [3] (see also 0). the authors proved that the invariance set of a shift invariance space V C L 2 (J8L d ) 
is a closed additive subgroup of that contains Z d . For instance, in the case of the line the invariant 
set of a shift invariant space could be Z, ^Z for some n £ N or R. 

Definition 2.9. Let $ C L 2 (K d ). We will say that V = 5(3>) is M extra-invariant if T m f £ V for all 
m £ M and for all / £ V. 

If M = then the space V is translation invariant but generated by the integer translates of the set 
<F 


One example of a translation invariant space in R is the Paley-Wiener space of functions that are 
bandlimited to [—1/2,1/2] defined by 

PW = {f£ L 2 (R ): supp (7) c [-1/2,1/2]}. 

It is easy to prove that for a measurable set SlcM/ the space 
(4) Ho := {/ € L 2 (R d ): supp(/) C H} 

is translation invariant. Moreover, Wiener’s theorem (see El) proves that any closed translation invariant 
subspace of L 2 ( R d ) is of the form fl])- 

If V is a shift invariant space of length £ and M is an additive subgroup of R d containing Z d , we will 
say that V has extra-invariance M if V is M— invariant. Note that in this case, if $ is a set of generators 
of V, i.e. V = S(&), then 


5'( ( 1 > ) = span{T/</: 4> £ •!>, k £ Z d } = span{T a (f: (j> £ <f>,o! £ M}. 
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In 0] the authors characterize those shift invariant spaces V C L 2 (R) that have extra-invariance. They 
show that either V is translation invariant, or there exists a maximum positive integer n such that V is 
—Z—invariant. 

n 

The d-dimensional case is consider in [S]. There, a characterization of the extra invariance of V when 
M is not all R d is obtained. Given M a closed subgroup of containing 7L d and M* (x, m) £ 

Z Vra £ M}, the authors construct a special partition {B a } a& jy of R d , where each B a is an M*— periodic 
set and the index set Af is a section of the quotient Z d /M*. More precisely, for each a £ Af, 

(5) B a =n + a + M*={J (H + cr)+ro*, 

m*GM* 

where is a section of the quotient R d /Z d . We refer to [9j for more details. 

Using this partition, for each a £ Af, they define the subspaces associated to a given SIS V 

(6) V a = {f£ L 2 (R d ): / = X bJ, with g £ V}. 

Given / £ L 2 (R d ) define for a £ Af, the function f a by f a = fxB a - 

The authors give a characterization of the M —invariance of V in terms of the subspaces V„. More 
specifically they prove that 

Theorem 2.10. If V C L 2 (R d ) is a SIS and M is a closed subgroup of M. d containing Z d , then the 
following are equivalent. 

(i) V is M—invariant, 

(ii) V a C V for all a £ Af, 

(in) Jv^ipj) C Jy(w) for almost every co and each a £ Af, 

(iv) ifV = S( ( h) then Ttp a (u]) £ Jy(ui) a.e. lj £ U for all ip £ $ and all a £ Af. 

3. Optimality for the class of SIS with extra-invariance 

Here we consider the approximation problem for the class of finitely generated SIS with extra invariance 
under a given proper subgroup M of R d . 

Let us start introducing some notation. Let m, l £ N, M be a closed proper subgroup of R d containing 
Z d and T = {/i,..., f m } C L 2 (R d ). Define 

(7) Vm = {V : V is a SIS of length at most I and V is M-invariant}. 

Let Af = {or,..., a K } be a section of the quotient Z d /M* and {B a : a £ Af} the partition defined in 

©• 

For each a £ Af, we consider T a = {/f,..., C L 2 (R d ) where, fj is such that f? = fjXB a for 

3 = Also, let F = {/f 1 , •. •, /£,. }. 

For each w £ U let G^(u>) be the associated Gramian matrix of the vectors in T with eigenvalues 

Ai(w) > • • • > A mK (w) > 0. 

Using Lemma \2 .21 we have that these eigenvalues are measurable functions. 

Since f°“ is orthogonal to ff l if s ^ t, the Gramian G^(u) is a diagonal block matrix with blocks 
Gcr(ui), a £ Af. Here G CT (w) is the m x m Gramian associated to the data . On the other hand, using 
Lemma [2~21 we have that 

G a (w) = U a (uj)A a (uj)U*(io) a.e. uj £U 

where U a are unitary and A „(uj) := diag(AJ(w),..., A^(w)) £ C mxm and they are also measurable 
matrices as in Lemma [2721 We also have Af (w) > • • • > A^(w) for each a £ Af. 

Using the decompositions of the blocks G a we have that 

(8) Gj=(w) = U{uj)A{u})U*{oj) 

where U has blocks U a in the diagonal, and A is diagonal with blocks A CT . We want to recall here that for 
almost each u> the matrix A(w) collects all the eigenvalues of the Gramian Gp{af) and the columns of the 
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matrix U(oj) are the associated left eigenvectors. Note that an eigenvector associated to the eigenvalue 
\j (w) has all the components not corresponding to the block a equal to zero. 

Now for each fixed w £ M, we consider {(U(w), ji(w)),..., (i n (ui),j n (uj))} with i s {ui) G A f and j s (u> ) G 
m} and n = mu such that 


»lM -v. •*> \*»M 

AM - - A j»M 


> 0 


are the ordered eigenvalues of G-p(ui), with corresponding left eigenvectors G C", for s = 

1) •' ‘ ,n- 

Here i s (u>) indicates the block of the matrix G^(w) in which the eigenvalue Ais found and } s (w) 
indicates the displacement in this block of the matrix More precisely, we have that A* s ^(w) 

coincides with A^to^^+^toAu;), the — l)m + th eigenvalue of Gp(uj). When ui G U is 

fixed, we will write i s instead of i s (u) and j s instead of j s (u>). 

We will prove now that 7 s (w) := is measurable as a function on w for each s = 1, • • • , n. 

J s(uj) 

Let sG{l,...,n} fixed. Let i s (u) G J\f and j s (w) G {1,..., to}. We have that 7 s (w) = AJ (w) for all 
w G E aj := {w£W: i s (u) = cr,j 3 (u) = j}- 
We observe that 


E a j = (w G U: i s (uj) = a,j s (u) = j} = {w G U\ A s (w) = AJ(w)}. 

Using Lemma f2~2l applied to G^(co) and G a {uf), we have that A s and AJ are measurable functions of ui. 
Therefore E a j are measurable sets. 

We further observe that 7 s (u>) = AJ(w), for w G E a j. So 7 s (w) is a measurable function. A similar 
argument shows that the eigenvectors are measurable. 

Finally we define h s : R d —> C, for s = 1,..., l 

m 

(9) '■= M E vfc-Vm+kMfk ( w )> 

fc=1 

where d) s (uj) = (Af (a;))" 1 / 2 if A* s (w) ^ 0 and 0V (w) = 0 otherwise. 

J s J s J s J s 

Now we are ready to state the main result of this section. 

Theorem 3.1. Let m,£ G N, and M be a closed proper subgroup of R d containing Z d . Assume that 
J- = {fi, ■ ■ ■, fm} C L 2 (M. d ) is given data and let be the class defined in (f7j). Then, there exists a 
shift invariant space V* G V l M such that 

m 

(10) V* = argmin'Y.Wfj - Pvfjf- 

v ^i, j=l 

Furthermore, with the above notation, 

(1) The eigenvalues {AJ(w) : a G M,j = 1,... ,m}, are h d —periodic, measurable functions in L 2 {U) 
and the error of approximation is 

m „ mn 

T(T, M, t) := Y, II fj - Pv* fi II 12 = / E X l M 

j'=l Ju s=l +1 

(2) The functions {hi ,..., he} defined in f9} are in L 2 ( R d ) and if ipi,... ,ipe are defined byfpj= hj, 
then = {ipi, (pe} is a generator set for the optimal subspace V* and the set — k),k G 
Z d , i = 1, • • • , £} is a Parseval frame for V*. 


Proof. Let V 1 be the class defined in Theorem [23 that is V £ is the set of all shift invariant spaces V that 
can be generated by £ or less generators. (Note that we do not ask the elements of the class V £ to have 
extra invariance.) 
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Define V* £ to be the optimal space given by Theorem [277] for the data T . That is, 

m m 

(11) EEn//^*//n 2 ^ EEiitf-^tff vvev*. 

j=l creAf j= 1 

We claim that V* £ V P M (in particular is M extra-invariant) and it is optimal in this class for the data 
P, i.e. 

m m 

(12) E II U - Pvfit < E ll/i - p vfi II 2 V V £ V e M . 

3 =1 3 =1 


Let us prove first that V* is M extra-invariant. For this we will check that the generators of V* 
satisfy condition (iv) in Theorem 12.101 We have from ([5J that the Gramian G^{oj) can be decomposed 
as Gp(uj) = U(uj)A(lj)U*(uj) with eigenvalues {AJ(w) : a £ Af,j = 1,... ,m}. 

By Theorem 12.71 the £ generators of V* have the form defined in m, 


(13) 


M E vlZ-im+k^fk M 

fc=i 


for s = 1,... ,£. 


From m it is clear that (p 8 is supported in Bi s , since each /£" is supported in Bi s . Then if we apply 
the cut off operator to these generators we obtain p° (w) = p s (uj) if a = i s (uj) and p° = 0 otherwise. So, 
in any case <p 8 £ V* for all a £ Af, s = 1,..., £ which proves the M --invariance of V*. 

What is left now is to prove that V* is optimal over the class V e M , that is V* satisfies equation (fT?l) . 

For this note that if V £ V then V = V a . So we have for any / £ L 2 (R d ), 

\\Pvf \\ 2 = ii Pv E m 2 = ii E Pyf a \\ 2 = ii E Pv-r ii 2 = E ii^vm 2 = E n^rii 2 , 

cr£Af cr(zAf cr GAT (j€lJsT <tGjV 

which implies together with m that 


En p ^/,ii 2 >Eii p ^ii 2 > v yeV M- 

3 =1 3 =1 

The others claims of the theorem are a direct consequence of Theorem 12.71 

□ 


4. Approximation with Paley-Wiener spaces 

4.1. Preliminaries. In this section the class of approximation subspaces will be finitely generated SIS 
that are translation invariant. That is translation invariant spaces that are generated by the integer 
translates of a finite number of functions. 

More precisely, given leN define T P to be the set of all shift invariant spaces V = S(<pi ,..., ipe) for 
some functions cpi,..., ipz in L 2 (R d ), and such that V is translation invariant and the integer translates 
of {</q,..., pi} form a Riesz basis of V. 

Given a set T = {/i,..., f m } C L 2 (R d ), we want to find V* £ such that 

m 

(14) V* = argminJ2 II Si - P vfj f- 

Here Py denotes the orthogonal projection on V. 

Before going to the approximation problem, we will obtain a characterization of the class 7” . 

Using Wiener’s theorem, we have that V is a translation invariant space in L 2 (R d ) if and only if there 
exists a measurable set fl C R d such that 

V={f£ L 2 { R d ) : /(w) = 0 a.e. w G R d \ fl}. 
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Since 12 is unique up to measure zero, we will write V = Vq. 

Definition 4.1. Let 12 C be measurable and L C be a countable set. We say that 12 tiles when 
translated by L at level £ £ N if 

xn(w — t) = £, for a.e. co £ M d . 

tei 

In case of L = Z d we will say that 1I is an £ multi-tile. 

It is known (see for example m,) that 12 is an £ multi-tile of K. d , if and only if, up to measure zero, 
12 is the union of £ measurable and disjoint 1 tile sets. i.e. 12 is a quasi-disjoint union of £ sets of 
representatives of R d /Z d . 

Lemma 4.2. A measurable set 12 C R d , t multi-tiles R. d if and only if 

12 = 121 U • • • U 12^ U N, 

where N is a zero measure set, and the sets Clj, 1 < j < £ are measurable, disjoint and each of them tiles 
R d by translations on Z d . 

The following proposition characterizes the set 12 for the elements in T l . 

Proposition 4.3. A subspace V is in l~ e if and only ifV = Vq with 12 a measurable £ multi-tile of R d . 

Proof. Assume first that V £ so V = Vq for some measurable 12 C R d . Also, as a consequence of 
Wiener’s theorem, for almost all to £ld we have Jy (w) = £ 2 (CU) with O u = {k £ Z d : w + k £ 12}. To see 
this, we note that Jy(w) C £ 2 (O w ). For the other inclusion, fix w £ U. Using that 12 = (J fcgZd E k where 
Ek = (U + k) fl 12, we have that k £ O w , if and only if w + k £ E k . Hence, if a £ £ 2 (O u ) consider the 
function G w (£) = a kXE,,{£,)■ Since G u is in L 2 (12), the function g defined by g = G u is in V, and 

g(ui + k) = a k if k £ O Therefore, g £ V and a = rg(ui) £ Jy(w). 

Now, since V = S(<pi ,..., ipf), and the integer translates of (pi, ■ ■ ■ ,<pi form a Riesz basis of V, using 
Lemma l2.6l we obtain that {r<^i(w),..., Tipifuj)} form a Riesz basis of Jy(w) with the same Riesz bounds 
for a.e. u £U. We conclude that dim(Jy(w)) = £ a. e. oj £U. 

Since V is translation invariant, by the observation above dim(Jy(w)) = ffO w . Then ffO u = £ for 
almost all oj £U, which implies that 12 is an l multi-tile. (Here jfA denote the cardinal of the set A). 

For the converse, assume that 12 is a measurable £ multi-tile of R d . Define V = Vq. So, V is translation 
invariant. 

By Lemma [4.21 we have that 12 = 121 U • • • U 12^ up to a measure zero set, where each 12^ is a set of 
representatives or R d /Z d . We define ipj by its Fourier transform: <pj = yjq ; j = 1,.. .,£. 

Since {e 2mulk (pj : k £ Z d } is an orthonormal basis of L 2 (£lj), we have that {e 27rlulk (pj : k £ Z d ,j = 
1, ■■■,£} is an orthonormal basis of L 2 (12), and so, {tkpj '■ k £ Z d ,j = 1, ...,£} is an orthonormal basis 
of V, in particular a Riesz basis. □ 

4.2. The approximation problem for Paley-Wiener Spaces. Now we come back to our approxi¬ 
mation problem. In order to find an optimal subspace in the class T e for a set of data T = {/i,..., f m }, 
it is enough to find the associated £ multi-tile 12 in R d . It is not difficult to see that if we allow 12 to be 
any £ multi-tile the minimum in (11411 may not exist. So we will restrict 12 to be inside a cube that could 
be arbitrarily large. Let us fix N £ N. Define 

C N := [-(iV+l/2),iV + i/2] d , 

Mfi := {12 C C/v : 12 is measurable and £ multi-tiles R d } and 
T& := {V £ T e : V = Va with 12 e M e N }. 

With this notation we can state the main result of this section. 
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Theorem 4.4. Assume that m,t £ N and a set T = {/i,..., f m } C L 2 (U. d ), are given. Then for each 
N > t there exists a Paley- Wiener space V* £ T~^ that satisfies 

m 

(15) V* = argmin^Wfj - P v fj\\ 2 , 

ve Tf j=1 

where is the class defined above. 

Proof. First we observe that if a solution space V* exists then 


(16) V* = argmin^Wfj - P v fj\\ 2 = argmax'Y^\\P v fj \\ 2 , 

v&U j=1 vaTf J=1 

and using the definition of we have that 


(17) 


max 

vaU 


En p ^iii 2 = 


i=i 


= max 


Ei 

3 = 1 


PvJj 


So, we need to find SI £ M^ that yields the maximum in Gilt- 
Using Lemma [2751 we see that for each fl £ M^, 


(18) 


Eii p ^-ii 2 = Eii p vs£ii 2 

3 =1 3=1 



Recall that Pj Vn (w) denotes the orthogonal projection onto the closed subspace Jvh(w) of £ 2 {Z d ). 

Furthermore, if SI £ Mir , we know from the proof of Proposition 14.31 that dim( Jy n (w)) = £ for a.e. 
uj £ IA. Note that Jy n (w) agrees with the subspace of £ 2 (Z d ) of the sequences supported in O u . Then there 
exists a unique set of i integer vectors k n (w) = {fcp(w),... ,kf(uj)} C Z d such that span{<5 fc o( w ) : j = 

1 ,...,£} = Jvh(w), for a.e. uj £ U. Here Sj denotes the canonical vector in t 2 (Z d ). i.e. Sj(s) = 0 if s ^ j 
and 1 otherwise. Note that , since Cl C Cn necessarely \\k^ (w)lloo < IV, for each j and uj. Combining this 
observation with (1T5T) we obtain, 

m p m i 

(19) En p ^/iii 2 = / EEi£( w+fc ?M)i 2 ^- 

j —i J IA j—\ i 

So, now we need to maximize the left hand side in (fTTJl) over all the sets fl £ Mjq. 

Note that given S 1 £ Mfj , for almost each uj £ U, the set SI contains exactly £ elements from the sequence 
{uj + k,k £ Z d }. Then we can pick for each uj £ U (up to a set of zero measure) £ translations k*(u >) 
such that Z'jLi X) s =i l/i( w + %(w ))| 2 is maximum over all sets of £ translations k = {%,..., %} C Z d , 
with ||%Hoc < N. The maximum exists since the fibers of fj are S? 2 (Z d )-sequences and the number of 
translations considered is finite. 

Call 1C the set of admisibles translations i.e. K, = {k = {%,..., %} C Z d : ||%Hoc < N} and for k £ 1C 
set H k (w) = ZJLi Eh \?A U + W)l 2 - 

Our goal is to construct a set fl such that the associated space Vn is optimal. So the idea is to construct 
the optimal set 0* considering for each w £ U the optimal translations {w + k*(uj) : s = 1,... ,£}, and 
then taking the union over almost all uj £ IA. 
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For this we define for each k = {ki,..., k^} £ 1C the following subset of U, 

£ k = {weW: #icM > H r (w), Vr = {n,. ..,n} e JC}, 

i.e., is the set of oj £ U for which the maximum is attained for k = {fci,..., kc}. Note that E k could 
be the empty set for some k = {fci,..., he} and the sets E k may not be disjoint. 

Finally we define our optimal set as, 

t 

O* = [J U E * + kj- 

ke/c 7=1 


We will now prove that 17* is measurable. First we note that E k is a measurable set for each k £ /C 
since, 

^=n 

re/c 

where, 

F r k = {wGW: H k (u) > H r (u)} . 

Now, since F ^ is measurable for all r £ /C, we obtain that E k is measurable and so is 17*. 

Furthermore, by construction, 17* is in M l N . Since for all 17 £ M^ we have that, 

m £ mi 

| fj(cj + k*(co))\ 2 for almost all uj GU, 

j=l S = 1 j —1 S= 1 

taking the integral over U we get 


m n, m £ 

\\ P Vnfj\\ 2 = / + k s(u))\ 2 

J=1 •'W -7=1 s== l 

£ 




7=1 s=l 


7 = 1 


This shows that 17* £ M^ is optimal over all 17 £ M^. We conclude 
the data F. 


that Vh* £ 7]y is a solution for 

□ 


Remark 4.5. Notice that if 17^ is the optimal multi-tile set for the class 7jy for some data F, then the 
approximation error is given by 

„ m r. m „ m 

£ n (F,£)= ^ j \f j {u)\ 2 du= l ^|^(w)| 2 dw+ / X! |/jM| 2 du;. 

j=\ ^Cn\Q*n j = i «/R d \CN j = i 

Clearly £n{F, £) > £at+i(.F, ■£). So £{T,£) := limjv-*x) £n{F, f) is somehow the optimal error. Since 
F" C L 2 (M. d ) then the second integral goes to zero when N goes to infinite, for functions with good decay 
at infinite we will be close to the optimal error for conveniently large N. 

Remark 4.6. In Proposition ^. 31 we show, for an element of T , how to construct a set of generators that 
gives a Riesz basis of translates in 7L d . There are many ways to construct other sets of generators that 
gives Riesz basis of translates. Recently Grepstad-Lev in ng constructed a basis of exponentials for 
L 2 ( 17) when 17 C is a multi-tile. Later on, Kolountzakis [21] gave a simpler proof of this result in a 
slightly more general form. Precisely they prove the following result. 

Theorem 4.7 (Theorem l.BTj). Suppose 17 C is bounded, measurable and multi-tiles R d when translated 
by Z d at level £. Then there exist vectors a \,..., at £ R d such that the exponentials 

e -2 Ki(a j+ k)u j = k( zjd 


form a Riesz basis for L 2 {Tl). 
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From Theorem 14.71 we can obtain immediately a set of generators for Vn- Let (p be such that (p = \n- 
If a ±,..., ai G are as in Theorem 14.71 then Vo = S(ip \,..., (pi) with <pj = t aj ip, j = 1,, £, and the 
translates of (p±,... ,ipe form a Riesz basis of Th¬ 
in general, all the Riesz basis for Vh can be described in the following way: 

Let A = {oj S } G [L 2 (U)Y xe be a measurable matrix, such that 0 < c\ < A (to) < C2 for every eigenvalue 
A(w) and for almost each to & U. Set k(to) = (k\(to ),..., k(,(to)) such that w + k s (to) G f2. Define p>j such 
that ipj(to + k s (w)) = aj S {to). Using the results stated in subsection 12.11 it is not difficult to see that 
(pi ,... ,ipe are measurable, Vq = S(ip ±,..., (pi) and the translates of <pi, ■ ■ ■ ,<pe form a Riesz basis of Vh- 

5. The discrete case 

The optimal subspace V* in Theorem 13.11 is the closest to the data T over all subspaces V in the 
class Vm- It is n °t difficult to see that almost each fiber space Jy*(to) C £ 2 (Z d ) of V* is the closest to 
the fibers of our data, r(J r )(w) = (r/i(w),..., r/ m (w)} over a certain class of closed subspaces of £ 2 (Z d ) 
that we will call V e hl -. Clearly this class of subspaces is determine by the class V^. So, Theorem 13.11 
implies an approximation result in £ 2 (Z d ), for a very particular class determined by the extra invariance. 
Therefore it is interesting to see if this approximation result in £ 2 (Z d ), extends to more general classes. 
We will define in what follows a very general class 2?^-. The cases coming from the continuous case will 
be particular cases of our general definition. The proof of the optimality that we obtain is more general 
and can not follows from Theorem 13. II 

In what follows, the Hilbert space we consider is £ 2 (Z d ). We define the class of approximating subspaces 
in the following way: 

Let A f be an arbitrary finite set and {D a : a e JV} a partition of Z d , that is, Z d = U^eA rD<r, where 
the union is disjoint. 

For a G £ 2 (Z d ), we denote a a = lD„a, where Id, denotes the indicator of D a . Given S C £ 2 (Z d ) a 
closed subspace we define 

So- = {a CT : a G 5}, for each cr G TV". 

We define the class of approximating subspaces by 

(20) = {5 C t 2 (Z d ): 5 is a subspace, dim(5) < l and S a C 5, Ver G TV"}. 

Note that 5 G if and only if dim(5) < £ and 5 is the orthogonal sum of the subspaces S, 7 i. e., 

5 = © 0 . e A/'5 CT . 

For a given set A = {ai,..., a m } C £ 2 (Z d ) consider for each, a G TV", the Gramian matrix G a G C mxm 
of the data A a = {aj,..., a^}, that is ( G a )k,i = (a£, af ,), k, l = 1,..., m, with eigenvalues Af > ..., > 
A^, and orthonormal corresponding left eigenvectors yf,..., y^. 

Now set A = {AJ : j = 1,..., m, a G TV"} and collect in A( the £ first biggest eigenvalues of A that is 
if A G Ag then A > p for all p G A \ A^. 

Write A g = {Ai,..., Af}. For each s = we define the sequence q s G £ 2 {Z d ) in the following 

way: 

Since A s = for some a s G TV" and some j s = 1,..., m, then A s is an eigenvalue of G„ a . Let y Jj* be 
the corresponding left eigenvector y°’ = (yj* (1),..., yJ* (to)). 

Then define, if A s G Af, A s ^ 0 

m 

(21) q'~(K)- l/2 'ZVi:W a k- 

fc =i 

If A s = 0 we define q s to be the zero sequence. 

With this notation we can state the main theorem of this section: 

Theorem 5.1. Let to, £ G N and TV" a finite set. Assume that a set A = {ai,..., a m } C £ 2 (Z d ) is given. 
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Then there exists S* £ V^ that satisfies 

m m 

(22) ^||a J -Ps.a i || 2 <^||a j -Ps%|| 2 , VP e V^. 

i=i i=i 

Moreover, we have that 

(1) S* = span{q\, ■ • ■ ,qf\ where qi,--- ,qt are defined in (I2T1) . Also, the vectors {qi,--- ,qe} form a 
Parseval frame for S*. 

(2) The error in the approximation is 

£(A,Af,£) = E x - 

AeA\Af 

Proof. First, we observe that (B^l) is equivalently to, 

m m 

Ell p s*%-|| 2 >Ell p ^H 2 ’ 

3=i j=i 

Furthermore, if S £ then 

mm m m 

E n p *M 2 = E ii E p ^-n 2 = E ii E p sX ii 2 =EE n p ^«jn 2 . 

3=1 3=1 treW 3=1 o-6A^ <r€Af j=l 

where aj = J2aeAT a j • 

In order to construct an optimal subspace P* G for the data .4, we will find an optimal subspace 
P CT for each a £ Af, of dimension at most a a £ {1, ...,£} for the data A a = {aj, ..., af,}. The existence 
of the optimal subspaces are provided by Theorem 4.1 of j5]. We need di m (P<r) = Yla&M a ° = 

dim(P*) <1. 

Thus if Q = {a = {a a } : 0 < a a < £ and acr — then f° r eac h choice of a £ Q we will find 

optimal subspaces {P“ : a £ Af} and define P“ = © CTg 7 vP“. 

The candidate for P* is the space P“ which minimize the expression E^l) over all a £ Q. Let fj £ Q 
be the minimizer. Hence /3 satisfies, 


(23) ££» p ««jl| 2 <£ Ells’ll 2 Vaec. 

cr£j\f j =1 <j£j\f j =1 

Therefore, the subspace S* := = © is the optimal subspace we need. It is straightforward 

to see that P* £ and that P* is optimal. 

Using Theorem 4.1 of [2] we obtain that for each a £ Q, the error of approximation for the data A a 
and the class of subspaces of dimension at most a a is given by 

m 

£(A a ,a„) = E A «- 

S = Q' ct + 1 

So the distance between the a-optimal subspace P“ and the data A is, 

m 

(24) £(a) = E = E E A «- 

cr£Af cr€:J\f s=ot . a +1 

Let hi be the number of elements in AT. We see that E(a) is minimum when the mu — i eigenvalues 
used in (IM1) are the smallest from the set A = {AJ : j = 1,..., m, cr £ Af}. Therefore if we set A<cA 
the set of the £ biggest eigenvalues from A, the optimal /3 = {fi a } £ Q satisfies that 

|J{AJ,...,A^} = A f . 

crEA/” 
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Therefore, 

m m 

£(aa/^) = E^-^) = EEK- p sKii 2= E E a *= E a - 

<tGA/" ctGA/*j = 1 <tGA/* ■s = /3 0 --|-1 AgA\A^ 

In order to construct the generators of S@ it is enough to construct the generators of each S%. Since 
are optimal subspaces for the data A a according with Theorem 4.1 of [2] the generators of are 
given by ED- That is the set {q s : <j s = a} is a Parseval frame of S%. 

Since the subspaces are mutually orthogonal, {qi ,..., qe} is a set of Parseval frame generators for 
the optimal space S* = . □ 

Remark 5.2. As explained at the beginning of this section there is a reason to consider this particular 
class of subspaces for the the discrete case. If a SIS V is M extra-invariant for some proper subgroup M 
of containing Z d , then its fiber spaces Jy(w) satisfies exactly the conditions that we imposed on the 
class V where the partition of Z d is {B a n Z d : a £ Af} and B a and A f are as in ©■ So, the discrete 
result (Theorem 15.11) provides a different proof of Theorem 13.11 using properties of range functions (12.31) , 
without the need of Theorem 12.71 Actually this proof includes Theorem 12.71 
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